Intrinsic Mean Square Displacements in Proteins 
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The thermal mean square displacement (MSD) of hydrogen in proteins and its associated hydra- 
tion water is measured by neutron scattering experiments and used an indicator of protein function. 
The observed MSD as currently determined depends on the energy resolution width of the neutron 
scattering instrument employed. We propose a method for obtaining the intrinsic MSD of H in the 
proteins, one that is independent of the instrument resolution width. The intrinsic MSD is defined 
as the infinite time value of (r 2 ) that appears in the Debye-Waller factor. The method consists of 
fitting a model to the resolution broadened elastic incoherent structure factor or to the resolution 
dependent MSD. The model contains the intrinsic MSD, the instrument resolution width and a rate 
constant characterizing the motions of H in the protein. The method is illustrated by obtaining 
the intrinsic MSD (r 2 ) of heparan sulphate (HS-0.4), Ribonuclease A and Staphysloccal Nuclase 
(SNase) from data in the literature. 



I. INTRODUCTION 

Following pioneering experiments and subsequent de- 
velopments, the mean square displacement (MSD) of hy- 
drogen in proteins can now be readily observed in neutron 
scattering experiments (H4Tl| . Specifically, the global av- 
erage MSD of H throughout the protein is typically de- 
termined from the elastic component of the incoherent 
dynamic structure factor (DSF). In proteins at low tem- 
perature, the MSD is small. As temperature is increased 
the MSD increases and often goes through a marked in- 
crease at a specific temperature or temperatures, To, 
denoted the dynamical transition. 0-0, fl3 - [l6| The on- 
set of large values of MSD are associated with the onset 
of function in proteins. 0, [17H22J Essentially, the large 
amplitude MSD enables contact between different parts 
of the protein which promotes chemical activity, function 
and possible folding. A large MSD is used as an indicator 
that function in a protein is possible. 

In most current methods of data analysis, the MSD 
and Tjj extracted from experiment depend on the en- 
ergy resolution of the neutron scattering instrument em- 
ployed. Different MSD (r 2 ) exp are extracted from data 
observed on different instruments. The higher the res- 
olution, the larger that apparent (r 2 ) exp observed and 
the lower the apparent Try observed. A high energy res- 
olution instrument is needed to observe all the motions 
that contribute to the MSD, including the slow, long time 
motions. Indeed, measurements in the same protein at 
the same hydration level on different instruments have 
been made explicitly to demonstrate the dependence of 
(r 2 )exv on the instrument resolution. (23 25] 



Specifically, the observed (r 2 ) exp is typically obtained 
from the observed, resolution broadened elastic incoher- 
ent DSF, 0(Q, u> — 0), as a function of wave vector trans- 
fer, Q, as, 



/ exp 



= -3- 



dlnO exp (Q, uj = 0) 



(1) 



Examples of this (r 2 



dQ 2 

obtained on different instru- 



ments which displays the dependence of ( T ) exp in the 
instrument energy resolution width are shown in Fig. [TJ 

The goal of the present paper is to extract the intrin- 
sic MSD, (r 2 ), from the neutron scattering data. This 
is the intrinsic MSD independent of the instrument res- 
olution width, W. The intrinsic MSD is defined here as 
the (r 2 (t — oo)) = (r 2 ) value of the MSD that appears 
in the full Debye-Waller factor. The intrinsic MSD in- 
cludes motions up to t — oo. It is the MSD that would 
be observed on an instrument with zero energy resolution 
width, W -> 0. 

To obtain (r 2 ) from data, a simple model of the ob- 
served incoherent DSF, 0(Q,uj), which includes the (r 2 ), 
the resolution width W and a simple description of the 
motional processes with a single relaxation parameter, 
A, is developed. The simple model for the normalized, 
resolution broadened 0(Q,lu — 0) is, 



O n (Q,oj = 0) + (l-/oo) 



W 



(W + A) 



.4 



where 1^ is the Debye-Waller factor, 



loo = exp(--( 



(2) 



(3) 



and A is an additive constant included if the data in- 
cludes a constant. Eq. ([3]) is the definition of the intrinsic 
(r 2 ). The model is fitted to the observed 0(Q, uj = 0) for 
a given W with (r 2 ) and A treated as free parameters to 
be determined by best fit. An intrinsic (r 2 ) independent 
of W is obtained from fits to existing data in this way. 

As in experiment, Eq. (1), a MSD (r 2 } s [ ope obtained 
from the slope of the model 0(Q,uj = 0) given by, 



) slope 



t dhiO N (Q,uj = 0) 



(r 2 )/[l 



dQ 2 
W } 

loo A 



(4) 



can be introduced. As with (r 2 ) exp , this (r 2 ) s i ope de- 
pends on the instrument resolution width W, specifically 
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FIG. 1: MSD of H in proteins observed on neutron scattering 
instruments having different energy resolution widths, W: (a) 
in hydrated heparan sulphate (HS-0.4) observed by Jasnin et. 
al[H] on IN16 [W = 1 fieV), IN13 {W = 10 fieV) and IN6 
(W = 100 /xeV), (b) in hydrated Ribonuclease A observed 
by Wood et. alQ on IN16 (W = 1 ^eV) and IN5 (W = 
100 /ieV), and (c) in hydrated Staphysloccal Nuclase (SNase) 
observed by Nakagawa et. al'25 ! ] on INfO (W = 1 ^eV), 
IN13 (W = 10 /ieV), HER {W = 100 fieV) and GP-TAS 
(W = 1 meV). 



on the ratio W/X. The expression for (r 2 ) s i ope cor- 
responds to the (r 2 ) exp and can also be fitted to the 
observed values of (r 2 ) exp such as shown in Fig. [TJ to 
obtain the intrinsic (r 2 ). The intrinsic (r 2 ) is always 
greater than (r 2 ) s ; ope . The intrinsic Td is defined as the 
temperature at which the intrinsic (r 2 ) shows a marked 
increase with increasing temperature. There can be more 
than one Tr,[ll,[l|. 

In the following section, we develop the model incoher- 
ent DSF. In section 3, the model is fitted to observed val- 
ues of the elastic incoherent DSF found in the literature 
to obtain the intrinsic (r 2 ) and To in three proteins. 
The intrinsic (r 2 ) and other parameters are discussed 
in section 4, where suggestions for making the present 
simple, illustrative model more sophisticated are made. 



II. FORMULATION OF THE MODEL 

A. Dynamical Structure Factor 

Neutrons incident on proteins interact and scatter pre- 
dominantly from the hydrogen nuclei in the proteins and 
in the associated hydration water. The hydrogen nu- 
cleas, the proton, has a large, incoherent scattering cross- 
section for neutrons, 82 barns, which dominates all oth- 
ers. Neglecting the scattering from the electrons and 
other nuclei, the observed scattering intensity is pro- 
portional to the incoherent dynamical structure factor 
(DSF), S mc (Q,uj) of the H, 



1 f°° 

Sine(Q,u) = — / dtexp(iujt)I inc (Q 7 t), 

27T J~oc 



where 



1 N 

I m c{Q,t) = —^2(e^p(-iQ.ri(t))exp(iQ.ri(0))) 



(5) 



(G) 



is the intermediate incoherent DSF. In Eq. (|S|), HQ and 
hu are the momentum and energy, respectively, trans- 
ferred from the neutron to the protein in the scattering. 

The neutrons scatter from the H at points fj (t) in the 
protein. The incoherent DSF is proportional to an av- 
erage over the N self correlation functions of the (t) of 
each individual H. The H is distributed in very different 
environments throughout the protein and these correla- 
tion functions may be quite different with different time 
scales (26l - l28| . Our first approximation is to represent this 
average over all H by a single representative H so that 
the IinciQit) reduces to 



I(Q,f) = (exp(-iQ.r(t))exp(iQ.r(0))>. 



(7) 



This self correlation function of r(t) represents a weighted 
distribution over correlation functions and will contain 
all time scales, short and long, of H throughout the pro- 
tein. We also drop the incoherent "inc" from S(Q, oj) and 



3 



I(Q,t) from now on. Eq. (JT]) is the usual approximation 
made in the analysis of experimental data. 

The neutron instrument has an energy resolution func- 
tion, denoted here by R(lu). A perfect energy resolution 
would be R(u>) = S(uj), no resolution width. A represen- 
tation of an R(ui) of finite width W is 

a Lorentzian function, which has Fourier transform 
R(t) — exp(— Wt). We will also use a Gaussian R(Q,oj). 
The observed function is a convolution of S(Q,lo) and 
R(w), 



of the form of I(Q, t) and R(t). Only for infinitely sharp 
resolution, W = 0, R(t) = 1 and R(uj) — S(uj), are all 
long time motions in I(Q,t) observed in 0(Q 7 uj — 0). 

In experiments, it is often assumed that the ob- 
served 0(Q,w — 0) is given by O exp (Q,oj = 0) = 
cxp[— iQ 2 (r 2 )]. The experimental (r 2 ) exp is then ob- 
tained as (r 2 ) exp — —3d\nO exp (Q 1 uj — 0)/dQ 2 as in 
Eq. ©• 

Since from Eq. (fl"2]) the observed 0(Q,oj = 0) depends 
on the instrument resolution width W, this (r 2 ) exp will 
depend on the instrument resolution. 



B. The Model 



/oo 
duj'S{Q,uj')R(u)-uj') (9) 
- OO 

i r°° 

= — / dtexp(iojt)I(Q,t)R(t) (10) 

27r J-oo 

i r 00 

0{Q.u) = — / dtexp(^)/(Q,i)exp(-ITi)(ll) 

27r J-oo 

Commonly measured is the elastic (zero energy trans- 
fer, w = 0) component of the incoherent 0(Q,ui), 

1 f°° 

0{Q,uj = Q) = — dtI(Q,t)exp(-Wt). (12) 



TABLE I: The resolution width and the resolution time of neutron 
scattering instruments. 



Instrument 


Energy Resolution 
OieV) 


w 

(THz) 


Time Scale 

(PS) 


IN16 


~1 


0.00025 


~4000 


IN 10 


~1 


0.00025 


-4000 


IN13 


~10 


0.0025 


~400 


IN6 


~100 


0.025 


~40 


IN5 


~100 


0.025 


~40 


HER 


~100 


0.025 


~40 


GP-TAS 


~1000 


0.25 


^4 



In 0(Q,uj = 0), we see that the role of a finite res- 
olution width is to cut off the integrand after at time 
t ~ IT -1 so that long time processes in I(Q,t) are not 
observed in 0(Q, uj = 0). The higher the instrument reso- 
lution, the longer the time in I(Q, t) that can be observed 
in 0(Q,u — 0). This is completely general independent 



The aim is to construct a simple model for I(Q,t) 
which we can fit to data to obtain the intrinsic value 
of (r 2 ) that includes the contributions from all motions, 
including long time motions. 

We begin by separating I(Q, t) into a t = oo (1^ — 
I(Q,t = oo)) and a time dependent part, 

I(Q,t) = I 00 + {I(Q,t)-I 00 ). (13) 

where, from Eq. ([7]), 

loo = HQ, t = oo) = (exp(-iQ.r(oo)) exp(iQ.r(0))) 
= (expHQ.r(oo)))(exp(iQ.r(0))) 
= (expHQ.r(0)))(exp(tQ.r(0))) 

= ex P (-iQV)+0(Q 4 )) (14) 

is the infinite time limit. To obtain the last expression 
we assume: (1) that r(oo) and r(0) are completely un- 
corrected so that the averages of them are independent, 
(2) that the system is translationally invariant in time 
(no CM motion) so that r(oo) = r(0) and (3) that in a 
cumulant expansion of (exp(— iQ.r)), cumulants beyond 
the second cumulant are negligible. The latter is valid if 
Q is small or if the distribution over r is approximately 
a Gaussian distribution. The cumulants beyond the sec- 
ond vanish exactly for all Q if the distribution over r is 
exactly Gaussian. 

We take (r 2 ) in I(Q, oo) in Eq. fpT4|) as the definition of 
the intrinsic, long time value of (r 2 ) in the protein that 
includes all motional processes. It is the (r 2 ) that would 
be observed with an infinitely high resolution instrument, 
R(w) = S(u>). That is, loo = exp(-Q 2 (r 2 )/3) is the 
definition of the intrinsic (r 2 ) as in Eq. ([3]). 

The time dependent part of I(Q, t) has the limits 

l'(Q,t)=I(Q,t)-I 00 = \ 1_/o ° * = ° 

10 t = oo 

We model this by the function 

l'{Q,t) = {l-I 00 )C{t) (15) 
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FIG. 2: Upper frame: Elastic DSF, On(Q,u = 0), of H in heparan sulphate observed by Jasnin et.al[2J] (open circles) and fit 
of model Eq. (T2J to the observed On(Q,u = 0). Lower frames: the best fit values of the intrinsic (r 2 ) and A. The solid line is 
a guide to the eye for (r 2 ), the same line for all three instruments. 



where C(t) has the limits C(t = 0) = 1, C(t = oo) = 0. 
An example is 

C(t) = exp(-Ai), (16) 

where A represents the decay constant of correlations in 
the protein. The model is 

I(Q,t)=I QO (Q) + (l-I 00 (Q))C(t). (17) 

which is constructed to have the correct limits at t = 
and t = oo and to have a plausible representation of a 
single motional decay process, for example diffusion, at 
intermediate times. 

Substituting the model I(Q,t) in 0(Q,u — 0) the ob- 
served elastic function is 

O(Q,u = 0) = — I dtI(Q,t)exp(-Wt) 

The data is usually presented as the "normalized" 
0(Q,u) = 0), the 0(Q,uj = 0) above divided by 0(Q = 



0,uj = 0) at Q = 0. From Eq. ([TH]), the model 0(Q = 
0,u = 0) = {nW)- 1 and O n (Q,uj = 0) is given by Eq. 
(2). A constant A is added to Eq. © since the data 
for different Q values are sometimes separated from one 
another in a figure by a constant for clarity. 

The On(Q,u = 0) is the simple model that we fit to 
data for a given W. The (r 2 ), A and A are treated as 
free fitting parameters to be determined by the best fit 
to data. The parameter A plays no role. In this way 
we determine the intrinsic long time value of (r 2 ) and A. 
The intrinsic (r ) should be independent of W. It can be 
compared with the experimental values {r 2 ) exp which are 
obtained from the slope of the observed On(Q,u = 0) 
using (12) and which are resolution dependent. Eq. @ 
is the initial simple model of On(Q,uj = 0) we use in 
section 3 to test the method. 

We can also obtain a value of the MSD using the slope 
expression, Eq. |T]), and our model On(Q,uj = 0) of 
Eq. ([21). This value, as with (r 2 ) exp , will depend on 
the instrument resolution W since On(Q,oj = 0) de- 
pends on W . The model MSD obtained from the slope, 
(r 2 ) s i p e — — 3dlnON(Q, w = Q)/dQ 2 and differentiating 
Eq. is given by Eq. We see immediately that 

(r 2 ) slope depends on the ratio of W/X. If W <C A, then 
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the instrument resolution is high enough to catch all the 
decay of I(Q, t) to I x and (r 2 ) s i ope — > (r 2 ), the intrinsic 
and full (r 2 ). If the fit of On{Q, w = 0) to data is good, 
so that A and (r 2 ) in Eq. Q are well determined, then 
we expect the (r 2 ) s i ope 
well. 



to reproduce the observed 



III. RESULTS 

In this section we fit our model of the observed, res- 
olution broadened elastic incoherent DSF On(Q,u = 0) 
given by Eq. ([2]) to data in the literature. The goal is to 
determine the intrinsic MSD (r 2 ) of H in specific proteins 
and obtain a value for the relaxation parameter A which 
describes the approach of (r 2 ) to its long time, intrinsic 
value. 



A. The MSD 

The top frame of Fig. 2 shows a fit of the model 
On(Q,u = 0) to the normalized 0(Q,uj = 0) observed 
in heparan sulphate (HS-0.4) by Jasnin et. al f24j . 
0(Q,uj — 0) is observed on three instruments which have 
different energy resolutions, W — 1, 10 and 100 /xeV. 
There is data for 0(Q,ui = 0) at seven temperatures, 
the lowest temperature at the top. The fits are generally 
good but some data points lie off the fitted line. The mid- 
dle and bottom frames in Fig. 2 show the values of (r 2 ) 
and A in the model On(Q,u — 0) that give the best fit. 
The best fit (r 2 ) increases with temperature, markedly 
for temperatures above T ~ 230 K. There is some scat- 
ter in the (r 2 ) which arises from the uncertainty in the 
fit. The scatter or uncertainty of A is particularly large 
showing that the data is relatively insensitive to the value 
of A. Conversely, we can say that the data is not very 
discriminating or sensitive to the model employed for the 
time dependence of C{t). However, importantly, the (r 2 ) 
emerging from the fit is independent of the resolution, 
W. The solid black line in the middle frame of Fig. 2 
is a guide to the eye through all the (r 2 ), the same line 
for all instruments. This shows that while there is sub- 
stantial fluctuations in the individual values of (r 2 ), the 
average (r 2 ) obtained is independent of the instrument 
resolution W. 

Similarly, Fig. [3] compares the intrinsic (r 2 ), the model 



/ slope 



and the observed 



/ exp 



in HS-0.4. The solid 



r ), the same line for all three instruments. The 
exp is the MSD obtained by Jasnin et. al from the 



points are the intrinsic (r ) emerging from the fits in Fig. 
[5] and the solid line is again a guide to the eye through 
these 

(r% 

slope of their data using Eq. (JXJ) . The (r 2 ) s i ope is ob- 
tained from the slope of the model 0(Q, u> = 0) given by 
Eq. (HI). If the fit to the data is precise, the (r 2 ) s / ope and 
(r 2 )exp should agree. This is the case for IN13 and IN6 
but less so for the IN16. When the instrument resolu- 
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FIG. 3: MSD, (r 2 ) exp , in hydrated HS-0.4 observed Jasnin et. 
al[2j] on IN16, IN13 and IN6 (open squares). The intrinsic 
(r 2 ) (solid circles) are the intrinsic MSD obtained by fitting 
the model Eq. J2]) to data shown in Fig. [2] The solid line 
is a guide to the eye through the intrinsic (r 2 ), the same for 
all instruments. The {r 2 ) s i ope (open circles) are the MSD 
calculated from Eq. Q which should be similar to {r 2 ) exp . 



intrinsic (r 2 ). This is the case for IN16 where (r 2 ) s i ope 
(open circles) lie on top of the (r 2 ) (black dots). For 

differ somewhat for T > 



tion W is small, we expect (r 



IN16, the (r 2 ) s i ope and (r 2 



1 slope 



to coincide with the 250 K. This indicates that there is not a good fit at higher 
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FIG. 4: Upper frame: Elastic DSF, On(Q, u = 0), observed by Wood et. al[23l] on inl6 and IN5 (open circles) and fit of model 
Eq. ([2]) to the observed On(Q,w = 0). Lower frames: the best fit values of (r 2 ) and A. The solid line is a guide to the eye to 
the best fit {r 2 }, the same line for both instruments. 



temperatures, as can be seen in the upper frame of Fig. [2] 
The essential point of Fig. [3] is that the intrinsic (r 2 ) is 
independent of W and that (r 2 ) shows a marked increase 
at temperature, Trj ~ 230 K, the intrinsic dynamical 
transition temperature of HS-0.4. Similarly, the upper 
frame of Fig. 2] shows the fit of the model On(Q,u> = 0) 
of Eq. ^/ to the measured 0^(Q, oj = 0) in Ribonuclease 
A observed by Wood et. al[23|]. The fits are good except 
that there is significant scatter in the data taken on IN5. 
The best fit values of (r 2 ) and A are shown in the lower 
frames of Fig. [4] Again the best fit values of the intrinsic 
(r 2 ) is independent of the instrument energy resolution 
W, as seen from the solid line, a guide to the eye to the in- 
trinsic (r 2 ), the same for both instruments. In Fig.[5j the 
intrinsic (r 2 ), the solid line, is compared with the resolu- 
tion dependent observed (r 2 ) exp obtained from the slope 
of the data with Q 2 . While the (r 2 ) exp is very different for 
the two instruments displaying the dependence of (r 2 ) exp 
on the resolution width W, the (r 2 ) exp observed on IN16 
agrees well with the intrinsic (r 2 ). This indicates that 
(r 2 ) in Ribonuclease A reaches its final, equilibrium value 
within a time period observable on IN16 (t ~ 4 nanosec- 
onds). In a similar way, we fitted the model Eq. @ to 
values of On(Q,u = 0) observed by Nakagawa et. al[25j 
in Staphysloccal Nuclase (SNase) on four different instru- 
ments. The resulting intrinsic (r 2 ) obtained from the fits 
are shown in Fig. [S] as solid dots for each instrument 
with a solid line (guide) through the dots, the same line 
for all four instruments. Although there is substantial 



scatter in the (r 2 ), an intrinsic (r 2 ) independent of the 
instrument resolution width W can be obtained by fit- 
ting the model to the observed Ojv(Q,w = 0). To test 
the sensitivity of the results to the model of C(t) and to 
the shape of the instrument resolution (IR), we changed 
C(t) to Gaussian, C(t) = exp(-A 2 i 2 /2), and the IR to 
Gaussian, R(oj) = enp(-uj 2 /2W 2 )/V2itW 2 . With both 
C(t) and R(ui) a Gaussian, an analytic expression for 
On(Q,u = 0) can again be obtained. Fig. [7] shows the 
resulting intrinsic (r 2 ) obtained from fitting the Gaus- 
sian model to the data of Nakagawa et. al. The (r 2 ) in 
Figs. © and ([7J are barely distinguishable. This shows 
that intrinsic (r 2 ) obtained is not sensitive to the form 
of C(t) and R(lj) used in the model at the present level 
of precision of the data. 



B. The Dynamical Transition Temperature, To 

In this subsection, we consider the apparent depen- 
dence of the dynamical transition temperature on the 
instrument energy resolution W '. The upper frame of 
Fig. |S] shows the (r 2 ) exp of H in glutomate dehydroge- 
nase observed on IN16 and IN6. The (r 2 ) exp is obtained 
as usual from the slope of the observed O exp (Q,uj = 0) 
with Q 2 as given by Eq. ([1]). In this example, the dy- 
namical transition temperature, Trj, the temperature at 
which (r 2 ) exp increases markedly with temperature, ap- 
pears to depend on the neutron instrument used. On 
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FIG. 5: MSD, (r 2 ) e xv, in hydrated Ribonuclease A observed 
by Wood et. al [23] on IN16 and IN5 (open squares). The 
intrinsic MSD (r 2 ) (solid circles) are obtained by fitting the 
model Eq. ([2]) to the data shown in Fig. [4] The solid line is 
a guide to the eye to the intrinsic MSD, the same for both 
instruments. The MSD (r 2 ) s i ope (open circles) is calculated 
from Eq. @. 



IN16, the apparent T D is T D ~ 150K while that on IN6 
is To ~ 230K. To determine the intrinsic (r 2 ) and To 
for this protein, we fitted the model (r 2 ) s i ope given by 
Eq. (jU) to the observed (r 2 ) exp in Fig. [3 Eq. (gj) is the 
model equivalent of Eq. (JTJ). Specially, at each temper- 
ature we determined the two parameters (r 2 ) and A by 
setting (r 2 ) s i ope in Eq. Q equal to the (r 2 ) exp of Fig. [8] 
for each instrument. The values of (r 2 ) s i ope actually fit- 
ted are shown as the solid lines through the data points 
in the upper frame. The resulting values of (r 2 ) and A 
are shown as the lower frames of Fig. [8l 

The intrinsic (r 2 ) shows two interesting features. 
Firstly, the intrinsic (r 2 ) is very similar to the (r 2 ) exp 
observed on IN16. The intrinsic T D is T D ~ 150 K. Sec- 
ondly, and unexpectedly, the (r 2 ) is found to decrease for 
T > 200 K and is lower at T ~ 250 K than at T = 200 K. 
This seems unphysical. To explore this effect, we arbi- 
trarily kept the intrinsic (r 2 ) constant at its 200 K value 



for temperatures T > 200 K. This (r 2 ) is shown in Fig. [9] 
The resulting {r 2 ) s i ope obtained when (r 2 ) is held con- 
stant for T > 200 K is shown as the solid line in Fig. |H1 
The (r 2 ) s i ope for W = 1 [xeV (IN16) continues to increase 
above T = 200 K but eventually reaches a plateau above 
250 K. This plateau of (r 2 ) at higher temperature appears 
to be unique for H in glutomate dehydrogenase or there 
is an issue with the sample or data on IN16 for T > 200 
K. Further study of this effect would be interesting. 

The chief result is that an intrinsic (r 2 ) and To inde- 
pendent of instrument resolution can be obtained. The 
intrinsic To is the temperature at which the intrinsic 
(r ) begins to increase markedly. In glutamate dehydro- 
genase, the intrinsic To is at To — 150 K in Fig. [3] This 
is very close to the To observed on IN 16 where W = 1 
[ieV. When W is larger, the apparent To seen in (r 2 ) exp 
is moved to higher temperature. The larger the W, the 
higher the apparent To- This finding is consistent with 
the results of Becker et aZ.[16(. Returning to the previ- 
ous subsection, we note that the data in Fig. |5] shows 
that the larger the W, the higher the temperature that 
is needed to observe a marked increase in (r 2 ) exp . That 
is, the apparent To of (r 2 ) exp in Fig. [5] is shifted to a 
higher temperature for larger W. In this way, the data 
shown in Figs. (JB]) and ((HJ are manifestations of the same 
phenomena. 



IV. DISCUSSION 

The goal of many measurements of quasielastic neu- 
tron scattering from proteins is to determine the ther- 
mal mean square displacement (MSD) of hydrogen in 
the protein and in its hydration water. The observed 
MSD, (r 2 ) exp , is generally obtained from the slope of 
the observed elastic incoherent DSF 0(Q,u = 0) using 
Eq. (jTJ). The 0(Q,uj = 0) is the resolution broadened 
DSF S(Q,w) at ui = 0. However, the (r 2 ) exp given by 
Eq. (P) is equal to the actual (r 2 ) of the protein only 
if the scattered intensity exactly at uj = is observed. 
This is the case only if the energy resolution width, W, 
of the instrument is zero so that 0(Q,uj = 0) reduces to 
S(Q,lu — 0). In this limit 0(Q,u = 0) is well approxi- 
mated by its time independent part 1^ given by Eq. , 
as seen from Eq. ((4]), and (r 2 ) exp equals (r 2 ). When W 
is finite the scattered intensity at finite uj around ui = is 
also observed in 0(Q, uj = 0). In this case the (r 2 ) exp ex- 
tracted from Eq. (fTJ depends on the resolution width W 
and is smaller than (r 2 ). More precisely, it depends on 
the ratio of W and to the rate constants that govern 
the motions in the protein. For a given protein, different 
{t )exp are observed on instruments having different res- 
olution widths W. The impact of a finite W is different 
in different proteins. 

The aim of the present paper is to provide a method 
in which the intrinsic (r 2 ) can be extracted from data 
taken on instruments that have different W. The essence 
of the method is to create a model of 0(Q,u> = 0) in 
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FIG. 6: MSD, (r 2 ) exp , of the hydrated SNase observed by Nakagawa et. al [U on IN10 (W = 1 fieV), IN13 (W = 10 /ueV), 
HER (W = 100 /ueV) and GP-TAS (W = 1000 /xeV)(open squares^ The intrinsic MSD (r 2 )(solid circle) is obtained by fitting 
the Eq. ((2j to the elastic intensity data observed Nakagawa et. al [2^] . The solid line is a guide to the eye to the intrinsic MSD, 
the same for all instruments. The MSD (r 2 ) s ; ope (open circles) is calculated from Eq. Q. 




Temperature (K) 



FIG. 7: MSD, (r 2 ) exp , of the hydrated SNase observed by Nakagawa et. al [H[ on IN10 (W = 1 /LteV), IN13 (W = 10 fxeY), 
HER (W = 100 ^eV) and GP-TAS (W = 1000 MeV)(open squares). The intrinsic MSD (r 2 )(solid circle) is obtained by fitting 
the observed DSF equation, where the Gaussian resolution and C(t) functions are used, to the elastic intensity data observed 
Nakagawa et. al [25|| . The solid line is a guide to the eye to the intrinsic MSD, the same for all instruments. The MSD 
{r 2 } sio P e(open circles) is calculated from the Q 2 dependence of the observed DSF. 
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FIG. 8: MSD, {r 2 )ex P , of glutamate dehydrogenase in 
CD3OD/D2O observed by Daniel et. al [H on IN16 (W = 
1 jUeV) (triangles) and IN6 (W = 100 fieV) (squares). Eq. © 
for (r 2 ) s i ope is fitted to the observed (r 2 ) exp and the fitted 
(r 2 ) slope is shown as a solid line. The lower frames show 
the intrinsic (r 2 ) and A obtained from the fit. The extracted 
intrinsic (r 2 ) decreases above T = 220 K. 



which the intrinsic, infinite time (r 2 ) appears explicitly 
and to fit the model to the observed 0(Q,uj — 0) to 
obtain (r 2 ). The model must also contain a descrip- 
tion of the motional processes and the resolution width 
W. We began with a simple model of the motions 
and a single, global Debye- Waller factor which led to 
Eq. @. By fitting the model to data taken on instru- 
ments that have different energy resolutions we showed 
that an instrument independent (r 2 ) can be obtained. 
The {r 2 } s i p e given by Eq. dU is the model equivalent 
of (r 2 )ex P - The model (r 2 ) s i ope can also be fitted to ob- 
served values of {r 2 ) exp to obtain the intrinsic (r 2 ) if 
the measured 0(Q,uj — 0) are not available. 

The dependence of the (r 2 ) exp obtained from the slope 
of 0(Q, ui = 0) with Q 2 using Eq. (TJJ, and its associated 
dynamical transition temperature, Tjj, on the instrument 
resolution width, W may be illustrated and clarified us- 
ing Eq. (0} for (r 2 ) s i p e . The (r 2 ) s i op e is the model equiv- 



FIG. 9: The MSD as in Fig. [8] In this case the intrinsic MSD 
(r 2 ) is arbitrarily held constant at temperatures T > 200K. 
The solid lines show the values of (r 2 ) s ; ope obtained for IN16 
and IN6 when the (r 2 ) is held constant for T > 200A'. The 
expected {r 2 ) B i ope continues to increase above 200 K. 



alent of {r 2 ) eX p given by Eq. (TTJ). Firstly, the intrinsic 
(r 2 ) is always greater than the observed (r 2 ) s iope at a 
given temperature. The larger is W, the further the ob- 
served (r 2 ) slope is suppressed below the intrinsic (r 2 ). 
Similarly, the intrinsic Tu obtained from the intrinsic 
(r 2 ) is always at a lower temperature than the observed 
To- The larger is W, the more the apparent To is shifted 
to higher temperature above the intrinsic Tjj. The de- 
pendence of (r 2 ) s i pe and Tu on W can be illustrated 
by plotting (r 2 ) s io P e for increasing values of W using by 
Eq. (01 as shown in Fig. [TD] As input, we use the in- 
trinsic (r 2 ) of glutamate dehydrogenase shown in Fig. [5] 
plus an extrapolation of this (r 2 ) to higher tempera- 
tures. A reasonable A similar to that Fig. |S]is also used. 
This intrinsic (r 2 ) has an intrinsic To — 150 K. From 
Fig- EE we see that at a given temperature, (r 2 ) s io P e cal- 
culated from Eq. ((4]) always lies below (r 2 ). The larger 
is W, the smaller is (r 2 ) s iope- Equivalently, we could say 
that (r 2 ) slope in Fig.nTj]is shifted to higher temperature. 
The larger the W, the further the (r 2 ) s ; ope is shifted to 
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FIG. 10: The resolution broadened MSD, (r 2 ) s i ope , for differ- 
ent resolution widths, W obtained from Eq.Q illustrating the 
dependence of the apparent Toon W. The intrinsic (r 2 )and 
A of glutamate dehydrogenase, obtained from the fits to data 
by Daniel et al. [23] shown in Fig. [S] are used in Eq. @ . 



higher temperature. These shifts may be described as 
a (r 2 ) s i p e that lies below (r 2 ) at a given temperature 
or a Td that is shifted to higher temperature than the 
intrinsic or actual Td- The reduction in (r 2 ) s i ope and 
the apparent increase in Td with increasing W are one 
and the same effect. 

The impact of a finite W can also be seen in I(Q,t) 
given by Eqs. ([T2]). (fl8]l and @. In Eq. ([12]). the inte- 
gral over time is cut off after a time t w = W^ 1 . If W is 
large, tw is short and long time motions/correlations in 
C(t) cannot contribute to the integral in Eq. (|12p . The 
0(Q, lj — 0) then contains only the shorter time motions 
and 0(Q,w = 0) depends on W. What is important is 
the ratio of W to A. If the intrinsic correlations in the 
protein (in C(t)) decay rapidly, A >> W, then all cor- 
relations in C(t) are included in Eq. (fl2t and the integral 
again becomes independent of W. Thus we expect the 
impact of finite resolution width W to vary from protein 
to protein. 

In any model used to extract the intrinsic (r 2 ), some 
characterization of the rate of decay of correlations rel- 
ative to W is needed. For example, in a White Noise 
model of correlations, C[t) — S(t), there are no correla- 
tions and any W will capture all correlations and yield 



the intrinsic 



In section 3, we saw that the (r 2 ) exp of 



HS-0.4 observed on IN16 (W = 1 /xeV) was close to the 
intrinsic (r 2 ). This means that all the correlations in 
C(t) of HS-0.4 are largely captured within a time scale 
W~ x — 4 ps. Whenever the observed (r 2 ) exp is close to 
the intrinsic (r 2 ), we may infer that the instrument reso- 
lution is small enough that all or most of the correlations 
contributing to C(t) have been captured in Eq. (fl"2"|) . 

The present model is very simple and is intended to 
be illustrative only. The model has several limitations. 



Firstly, we used a simple representation C(t) = exp(— At) 
to describe the relaxation of correlations in the protein. 
This C(t) is appropriate for a single diffusion process. 
The model could be improved by using a stretched ex- 
ponential which represents several diffusion mechanisms 
contributing to C(t). We found that this improved 
fits to simulations of I(Q,t) but generally the data for 
0(Q, ui) was not significantly precise to distinguish be- 
tween a simple and stretched exponential. There is also 
no representation of ballistic propagation or vibration in 
C(t). The A could also be Q dependent. In future appli- 
cations a more sophisticated C(t) could be incorporated. 

The present model uses only a single, global Debye- 
Waller factor 1^ in Eq. ©. This simplification is made 
by approximating Eq. © by Eq. ([7]). In a real protein, 
the H occupies a variety of positions that have differ- 
ent values of (r 2 ). If a distribution of (r 2 ) values is 
included 26-28J, there are terms in 1^ beyond a sim- 
ple Gaussian represented by a single parameter (r 2 ). In 
future applications a generalization to include a distribu- 



tion of 



values based on Eq. ([6]) should be incorpo- 



rated. 

Improvements to the resolution function used could 
also be made. Also, recent simulations suggest that some 
motions contribute to (r 2 ) only after long (picosecond) 
times. In applications of the present method to these 
proteins, the data from reasonably high resolution in- 
struments may be needed so that information on these 
longer time scale motions is included in the observed 
O(Q,w = 0). 



V. SUMMARY 

We have proposed a method for obtaining the intrinsic 
MSD of H in the proteins independent of the instrument 
resolution width, W . The method consists of fitting a 
model to the observed resolution dependent data or res- 
olution dependent MSD. The model contains the intrinsic 
MSD, the instrument resolution width W and a rate con- 
stant characterizing the motions of H in the protein. The 
method is applied to existing data in the literature to ob- 
tain the intrinsic MSD (r 2 ) of heparan sulphate (HS-0.4), 
Ribonuclease A and Staphysloccal Nuclase (SNase). 

The intrinsic MSD is defined as the infinite time (r 2 ) = 
(r 2 (t = oo)) that appears in the Debye- Waller factor. 
The intrinsic (r 2 ) is always greater than the apparent, 
resolution dependent (r 2 ) exp . From fits to data at dif- 
ferent temperatures, an intrinsic (r 2 ) as a function of 
temperature is obtained. This intrinsic (r 2 ) shows a 
dynamical transition. The intrinsic dynamical transition 
temperature, Td, is defined as temperature at which the 
intrinsic MSD begins to increase markedly with temper- 
ature. The intrinsic Td is always less than the apparent, 
resolution dependent Td- 

The essential ingredients of a model needed to obtain 
(r 2 ) are (1) a definition of (r 2 ), (2) a characteristic time 
(or times) of the motions that contribute to (r 2 ) and (3) 
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the instrument resolution width. The present model is 
simple and illustrative only and can be much improved. 
At the same time, much current data in the literature 
may not be sufficiently precise to distinguish between 
simple and more sophisticated models. 
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VII. APPENDIX 

The purpose of this appendix is to derive the expres- 
sion 

7 oc =exp(-io 2 (r 2 )), (19) 

to identify the approximations made to obtain it and to 
clarify the meaning of (r 2 ). loo is the t — > oo limit of the 
intermediate incoherent DSF, I(Q,t) defined in Eq. (J7J. 
I(Q, oo) is independent of time t. The Fourier trans- 
form of /oo is therefore purely elastic (zero except at lj 
— 0) and given by S e i(Q,u) — Ioo5(l)). This shows that 
loo arises from structure in the protein since translation- 
ally invariant systems such as gases and liquids that have 
no structure have no elastic scattering. Thus the motions 
that contribute to loo and (r 2 ) are therefore vibrations, 
hindered rotations, restricted diffusion and all motions 
that in some way are restricted by or reflect a structure. 
loo is exactly the Debye- Waller factor that appears in 
the scattering of X-rays or neutrons from crystals [30U32j |. 
In crystalline solids, loo is the reduction in intensity of 
a Bragg peak arising from atomic vibration in the solid. 
For H in proteins, loo is the reduction in intensity of the 
incoherent elastic scattering at any Q value arising from 
all restricted motions of the H. 

To obtain loo , we begin with I inc (Q, t) given by Eq. ([6]) . 
The first approximation is to reduce Eq. ^ to Eq. ||7J). 
If Eq. ^ were retained then terms beyond the Gaus- 
sian in Eq. JTSJ will be obtained [26l-|28}. Following the 
arguments below Eq. (fT4|) . the t — > oo limit of Eq. ([7]) is, 

loo = I{Q,t — oo) = (exp(— iQ.r(oo)) exp(iQ.r(0))) 
= (expHQ.r(0)))(exp(iQ.r(0))). (20) 



To arrive at the final line of Eq. ([20]) we have assumed: 
(1) that the position, r(oo) at t — > oo of each H is uncor- 
rected with its position r(0) at t = so the expectation 
values of r(oo) and r(0) are independent and (2) that the 
protein properties are independent of the time when they 
are observed so that (r n (oo)) = (r™(0)) = (r n ). loo is 
the product of two identical exponentials, one containing 
i and the other —i. 

To proceed we make a cumulant expansion of each ex- 
pectation value in loo , 

°° (i) n 

(exp(ix)) = exp[V — -/i„] (21) 
^-^ n\ 

where the \i n are cumulants and x — ±(Q.r). The cumu- 
lant s combination of moments, 

Mi = {%) 

M2 = (x 2 ) (x) 2 

^3 = {x 3 )-3(x 2 )(x)+2(x) 3 

Hi = (x 4 ) - 3(x 2 ) 2 - A(x 3 )(x) + 12(x 2 )(x) 2 - 6(x) 4 . 

(22) 

Because of the ±z in the exponentials in Eq. (I2U1) , the odd 
cumulants cancel. In loo only even cumulants appear. 
Thus, 

°2, (i) n 

loo = (exp(-iQ.r))(exp(iQ.r)) = exp[2 } — -fj, n ] 

n even 

1 2 

= exp[-^i 2 + y^^4 - ^yMe + •■•] (23) 

We also take (r) =0. Thus up to fourth order, 

loo - exp[-([Q.r] 2 } + ^[([Q.rf) - 3([Q.r] 2 ) 2 ] (24) 

On neglecting the fourth order cumulant and assuming 
cubic symmetry so that {[Q-r] 2 ) — Q 2 (r 2 } = ^Q 2 (r 2 }, 
where the z axis is chosen parallel to Q, we obtain loo in 
Eq. p^|) . The fourth order cumulant will be small if Q 
is small or if the motional distribution of the H atom is 
approximately Gaussian. 

Eq. (fTT)]) defines the intrinsic (r 2 ). This (r 2 ) is the 
same as that defined in simulations as the t — > oo limit 
of \{[r{t) - r(0)] 2 ) since (r(i)r(O)) = at t -> oo and 
(r 2 (oo)) = (r 2 (0)) = (r 2 ). Explicitly, the intrinsic value 
of (r 2 ) is defined as the value that appears in the Debye- 
Waller factor loo- 
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